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: 1 The Main Result 

in 
in , 

Consider the maximal operator 



Cdf(x) = sup 

deg(p)=d 



p.v. / f(x-y)e(p(y)) — 

y 



and the integral is understood in the principal value sense. This definition is motivated 
principally by the case a = 1. Ci/ controls the maximal partial Fourier integrals of / and it 
extends to a bounded map from LP into itself for 1 < p < oo. The critical contribution here 
is L. Carleson's proof [1] of the boundedness of C\ from L 2 into weak-L 2 . The L p version 
was established by R. Hunt [3]. Also see [2, 5]. 

It is natural to ask if the same results hold for larger values of d. Indeed, it does for the case 
of d — 2 and this is the main result of our paper. 

1.1. Theorem. C 2 extends to a bounded map from LP into itself for all 1 < p < oo. 



"This work has been supported by an NSF grant, DMS-9706884. 
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To prove the theorem, it suffices to show that C 2 maps L 2 into weak L 2 as our proof can be 
modified to treat all 1 < p < oo, and we briefly indicate how to do this in the next section. 



E. M. Stein [10] has observed that the Fourier transform of the distribution e(y 2 )/y has 
an easily calculable form, one that permits analysis of the maximal operator formed from 
dilations of this distribution. We relie very much on this observation. As well, we now have a 
much richer understanding of Carleson's theorem as presented in papers of Fefferman, Lacey 
and Thiele and Thiele [2, 5, 11] and some related issues [7, 8, 6]. We invoke some of these 
elements to provide a proof of our main theorem. Stein's argument and the overview of our 
proof are laid out in the next section. The main inequality described in that section requires 
a careful analysis in time and frequency variables, that being carried out in the remaining 
sections of the paper. 

The elegant results of K. Oskolkov [9] are of the same genre as ours. 

T. Tao and J. Wright informed me of this problem. G. Mockenhoupt brought Stein's article 
[10] to my attention. Part of this work was completed at the Centre for Mathematics and 
its Applications at the Australian National University. I am endebited to all. 

Notations: The Fourier transform is taken to be /(£) := J f(x)e(—2x^) dx. The norm 
of an operator T from LP into LP is written as ||T|| p ^ p with a corresponding notation for 
the weak type norm. By A -< B we mean that there is an absolute constant K so that 
A < KB. By A ~ B we mean A -< B and B -< A. c(J) is the center of the interval of J. 



The supremum we wish to bound admits a description as a supremum over dilations, for 
which there are a wealth of techniques to use, and a supremum over modulations in frequency, 
which is the domain of Carleson's theorem. It is useful to formalize these aspects with a 
couple of definitions. 

A distribution K(y) determines two maximal functions of interest to us. They are 



x.j(x) = (l + distfo J)!^- 1 )- 1 . 



2 The Overview of the Proof 
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C[K]f(x) := sup\V[K](e(b-)f)(x) 

6GK 



: SUP SUp 

feGM a>0 



J fix - y)e{by)a 1 K(a 1 y) dy 



Thus if K(y) = y 1 l{o<|y|<i}, is the maximal truncations of the Hilbert transform and 

C[K] is Carleson's maximal operator. Set K(y) := e(y 2 /A)/y. To prove our theorem we show 
that C[K] maps L 2 into L 2 '°°. 

We recall Stein's argument [10] that T>[K] maps L 2 into L 2 . The Fourier transform of K is 
a smooth odd function satisfying 

(2.1) K(£) = c + e(£ 2 ){ Cl /£ + c 2 /i 2 + ■■■} as £ - oo 

for some choice of constants c,, j > 1. Indeed dK(£) = J e(x 2 /4— x£) dx = ce(£ 2 ). Moreover 
K is odd as K is odd hence 

c-^(0 = / e(y 2 ) dy = / e(y 2 ) dy - / e(y 2 ) dy. 
Jo Jo Jf 



And the assertation follows as 



/oo 
e(y 2 )dy^e(e){c' 1 /C + c' 2 /C + ---} 



as £ — > oo 



With (2.1) established we can write K = H + Y^L 1 2 ■'fhj where H is smooth odd and equals 
Cosign(^) for |£| > 2. The multipliers fhj are of the form 

(2.2) m,-(0 = V(2- j 0e(C 2 ) 

where ip is a C°° function with support in | < |£| < 2. These multipliers are our main 
concern. 

Now, the term arising from H is governed by the Hilbert transform. In particular T>[H] is 
bounded from L 2 into L 2 as this is the maximal truncations of the Hilbert transform. In a 
like manner we have the estimate | \C [H] || 2^2,00 d 1 by Carleson's Theorem [1]. 

Stein has shown that ||P[mj]||2->2 di 2 J ' 2 , which then completes the proof of the bound on 
T>[K]. To prove our main theorem, we demonstrate that 

(2.3) ||C[mj] || 2^2,00 =< 2 7j for some < 7 < 1. 
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In fact 7 = 8/9 will work. [By optimizing our argument we could establish this estimate for 
any 7 > 1/2.] 



Stein's argument is crucial to our own and so we recall it here. To bound ||D[mj] || 2 ^ 2 it 
suffices to prove that 



(2.4) 



sup 

Ka<2 



-< 2^ 2 



m j>a (y)f(x - y) dy 

where mj A (y) := a~ 1 rrij(a~ 1 y). And to this end the method of TT* is invoked. Observe that 
(2.5) 



sup \m j:a * m jib (y)\ ^<S>j(y), y e R, 

l<a,b<2 



(2.6) 



where $j (y) 



2^\y\ 



-1/2 



\y\ < c2 j 



2 j (l + 2 j \y\)~ 2 \y\ > 

We can take c = 16. Note that % is non decreasing and f $j dy ^ 2 J which proves (2.4). 
After taking dilation into account, (2.5) amounts to the estimate 



(2-7) 



sup 

0<fe<4 



J ^(2"''Oe(&e 2 + 2£y) df| ^ ^(y) 



Here ^ is another Schwarz function with support in | < |£| < 2. Set p(£) = &£ 2 + 2£y. If 
|y| > c&2-? observe that the derivative of p with respect to £ exceeds c|y| on the support of 
■0(2 _J £). Thus repeated integration by parts will prove the estimate. If \y\ < c62- ? we can 
use the van der Corput second derivative test. It provides the estimate of the integral as 
r< b- 1 / 2 di 2 j / 2 \y\- 1 / 2 . Thus the inequality holds. 



The reminder of the paper is devoted to a proof of (2.3). 

To do so we use the time frequency analysis of Lacey-Thiele [5] with some further ideas 
drawn from Fefferman and Thiele [2, 11]. A central conceptual problem arises from the fact 
that rhj is supported in an interval of length 2 J but mj has (approximate) spatial support in 
an interval of the same length. That is classical Fourier uncertainty is not observed. Treating 
this issue is probably the main novelty of this paper. 

For our subsequent use observe these points. First in the definition of (2.2) we can assume 
that ip is supported in 1 — ^ < £ < 1 + ^ (as opposed to \ < |£| < 2). 
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Second (2.7) implies that J\rrij\ dy ^ 2 j hence ||P[raj] Hoodoo ^ 2 J so that by interpolation 
(2.8) \\V[ m] ]\\ p ^ p r< 2 j{1 - 1/p \ 2 < p < oo. 

Thus 2? [if] maps L p into itself for 2 < p < oo. 

Indeed, this estimate holds for 1 < p < oo. In fact, we have the estimate ||P[mj] ||oo^oo ^ 2 J , 
with the same estimate holding at p = 1 as well. These estimates require no cancellation, 
and so hold for C[rrij\ as well. Thus to prove our main theorem and in light of the extension 
of Hunt of Carleson's theorem, it suffices to provide the bound we have claimed for C[rrij] on 
L 2 . 



Third there is a sharper form of Stein's observation. Namely the operator 
(2.9) D j f(x) := sup sup / a^m^a' 1 ^ - y))e(Ny)f(y) dy 

l<a<2 \N\<23 J 

maps L 2 into L 2 with norm bounded by ^ 2^ 2 . Employing the same arguments as above, 
this amounts to the estimate 



sup sup 

0<6<4 |AT|<23+2 



/ 



^(2-^)e(&£ 2 + (N + y)0 dy ^(y) 



where rj (y) 



2 i/2 



y\ 



-1/2 



2 J '(i + 2 J '|y| 



\y\ < c2 j 
\y\ > c2 j 



In this definition, n is an arbitrary positive integer and c = 32. Details are a modification 
of the earlier argument. In fact we have ||-Dj|| p ^ 2- ? ( 1 ~ 1 / p ) for 2 < p < oo. We shall have 
recourse to this below. 

Fourth in proving the estimate (2.4) we follow the approach of Kolomogorov and Silverstoff, 
as Fefferman [2] has demonstrated that this is a powerful technique in issues related to 
Carleson's theorem. We show that there is a < 7 < 1 so that for all j, measurable 
functions N : R -> R, t : R -> Z and a : R -> [1, 1 + ±], 



(2.10) 



m jMx)2 tM(x - y)f(y)e(N(x)y) dy 



~< 2 13 



2,00 



We will do this with 7 = 8/9. This inequality is sufficient for our purposes. 
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Figure 1: A fat tile s and thin tiles a € thin(s). 

3 The Discrete Operator 

Let D be a collection of dyadic intervals in the real line. Let P fat be the set of rectangles 
s = I s xuj s E D x D which have area |/ s ||^ s | = 2 2j \ We call these "fat tiles" and we generically 
write s, s', s" for fat tiles. Let oj s \ (cj s2 ) be the left (right) half of uj s . This definition is chosen 
in accordance with the frequency and spatial localizations of the kernel rrij, its dilates and 
modulations. 

Let Pthin be the set of rectangles s G D x D of area 1. We call these "thin tiles" and we 
generically write a, a', a" for thin tiles. Set thin(s) := {a G Pthin : la = Is, w a C fu^}- For 
a G thin(s), set u a j := u S j for j = 1,2. See figure 1. [Actually, "thin tiles" obey classical 
Fourier uncertainty and so are thin only in contrast to fat tiles.] 

Fix a Schwarz function 6 with L l j_i < 4> < L_j_ For a rectangle a = l a x u a of area 

L 80 1 80 J L 4() ' 40 J 

1 (not necessarily a thin tile) define 

In this display and throughout, c(J) is the center of the interval J. 

Fix the data j > 1, / G L 2 of norm one, functions N, £ and a as in (2.10). For s G Pf a t, 
a G thin(s) and integer / with 2~ l = |c<j s2 |. Define 

E(s) = E{a) := {x G R : N(x) G u al , £{x) = I}, 
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<p a (x) = l E (*)(x) J MyH-^N(x)y)a(x)- l 2- l - 3 m 3 (a(x)- 1 2- l - j (x - y)) dy, 

sePf a t o-ethin(s) 

A principal motivation for these definitions is the proof of lemma 4.3 below. At this point 
we simply observe that the support of the integral in the definition of ip a is in E(a). rhj is 
supported in a small neighborhood of 2 j so that the second function in the last integral has 
frequency support in a small interval around 2~~ l . <p a is supported in a small interval around 
c(cu a ) with uj a C \uj a 2- So N(x) must be in u s i in order for the integral to be non zero. 

We claim that the following inequality is sufficient for (2.10). 

(3-1) M/ll^ :< 2* 7 = 8/9. 

In the proof of this inequality, we only consider sums over finite subsets Sf at C Pf a t- We fix 
data / e L 2 of norm one and the functions N, i and a. Let Mj be the sum restricted to this 
new smaller class of tiles. Then, by dilation invariance, (3.1) is implied by this inequality. 

(3.2) \{Mjf > 1}\ * 2 2 ^, 7 = 8/9, 

the inequality holding for all functions / of norm one. 



Proof of sufficency of (3.1) . A convexity argument can be used to show that (3.1) implies 
the inequality (2.3). Indeed arguments like this have been used many times in related papers, 
for instance [2, 5]. 

Let us give the convexity argument in an elemental form. For our subsequent use, let us 
define translation and modulation operators by Tr t f = f(x — t) and Mod t f(x) = e~ lxt f(x) 
for t e R. Observe that the sum 

^(/, Tr n (f)}Tr n (f) 

could bewritten as a sum over tiles. More importantly, 

pl poo 

/ , £{f,Tr n+t <j>)Tr n+t <j>dt= / (j \Tr t <j>)Tr t <j> dt 

= lf)*f 
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where ip(x) = f (f>(y)(f>(x + y) dy. Recall that we specified to be a Schwartz function with 
j_i < 6 < l r l so that ib satisfies a similar set of inequalities. 

y 80 ' 80 J y 40 ' 40 J 

Elaborating on this theme, observe that this sum 

(3.3) (f,Mod m Tr n <p)Mod m Tr n( p 

could be written as a sum over tiles. Define 

-1 r i 



A f = / ^2 (f,Mod m+T Tr n+t <f))Mod m+T Tr n+t <f) drdt 
Jo Jo cZ 

(/, Mod T Tr t (p)Mod T Tr t (j) drdt 



J0 m,n<L, 
— oo r — oo 



oo J — oo 



This is a multiple of the identity, as is easy to see. 
By periodicity, Af is also equal to 



(3.4) Af= lim lim(rt)- 1 / / V (f \Mod m + T Tr n ^)Mod m + T Tr n +^ drdt 
x^oot^oo J Q J Q ^ 

This concludes our general remarks on the use of convexity. 
Let us turn to the operator Mj. Define, for an integer / 

p w/= E E </><m^> 

sGPf a t o-Gthin(s) 
|^2|=2- ; 

and observe that this sum is similar to (3.3). We may average these operators over modula- 
tions and translations to obtain a multiple of the identity. This can be done in a way that 
is independent of I e Z and essentially independent of j > 1. We shall return to this point 
momentarily. 

To make the connection with our operator Mj more directly, observe that with the notation 
used in the definition of Mj, 

M jf( x ) = (Mod 2N ( x )Dil 2 i ix) - ja{x) m j ) * P jAx) f(x) 



8 



where Dilgg(x) = 5 1 g(x5 1 ). 

Thus the main point is that we can recover the identity operator from Pj i in a way that is 
independent of / and j and does not effect the assumed inequality (3.1). 

But certainly translation and modulation do not efect the distributional inequality. And, we 
can obtain the identity operator from the Pjj in this way. Recall that the tiles depend upon 
choices of dyadic grids D and D'. A translation of D (D') corresponds to an application of 
Tr t (Mod T ) to the functions (j) a . Thus the assumed inequality applies to any Mj obtained 
from translations of either grid. Finally, the periodicity property (3.4) shows that the identity 
operator can be obtained in a way that is independent of /. This completes the proof. 

□ 



4 Trees and size 



The principle definitions and lemmas are stated in this section. We show how they prove 
(3.2) and prove the Lemmas in the following section. We begin with requisite definitions. 

For s, s' G Pf a t say that s < s f iff I s C I s i and oj s D uj s i. Say that T fat C Pf at is a tree if there 
is a J Tfat x u Tm G P fat with s < J Tfat x u Tm for all s G T fat . 

A subset T thin C P t hin is a tree if it is a subset of thin(T fat ) for some tree T fat C Pf at . 
We denote the top of the tree by Ir thin x w T thin - A tree T t hi n is a 1-tree (2-tree) iff for all 
a, a' G Tthin either u a = cu a r or uj a fl cu a i = (uj a fl u a > ^ 0) . Note that if the scales of T t hi n 
differ by a factor of 2 2j [That is, if \I a \ < then 2 2j \I a \ < \I a i\ for all a, a'] then a tree 
Tthin can be uniquely decomposed as a union of a 1-tree and a 2-tree. [We also remark that 
these definitions play a role that is parallel to the notions of a tree in [5].] 

For Sf at C Pf a t, define the "size of S iat " to be 

size(Sfat) := sup |/T thin | _1 ^ K/.^>n 
r thinC thin(5 fat ) CTeTtMn J 

where the supremum is formed over all 1 -trees T t hi n C thin(S'fat) := lJse5^^ n ( s )- ^ ne 
central lemma concerning size is 
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4.1. Lemma. A finite collection <Sf at C Pf at is a union of collections Sf at (n) ; n e Z /or 
which size(S{ a x(n)) < j2 n and 

(4.2) £ |/ s |^2- 2n , 

se5 fat (n)* 

where Sf at (n)* consists of the maximal s G Sf at (n). 

Observe that j (that is a measure of how fat the tiles are) enters into this lemma, albeit in 
a weak fashion. 

For C Pthin set 

M / thin = E 

If S'fet C Pfat, define M? fat to be Mj hm ^ Slllt \ Concerning trees, our central lemma is 
4.3. Lemma. For all trees T fat7 

(4.4) \\Mj^\\ p r< ^(Tfat);^^- 5 ^!/^! 1 /? 2 < p < oo. 

(4.5) |Mf-(x)| ^ 2 2 ^^e(T fat ) X/Tfat (a:) m , x £ 2/ Tfat , m > 1, 

where Xj( x ) = (1 + dist(x, J)| J| -1 ) -1 . 

Notice that the first estimate should be compared to Stein's estimate for P[Mj], and is only 
slightly worse than that estimate if p = 2. That the (large) factor of 2 2j enters into the 
second estimate is completely harmless. 

Set e = (200)- 1 , p = 9/4, /i = 7/9 and 7 = 8/9. 

For n > — 7j, we in essence relie upon the fact that Mj !at ^ is supported on a set of small 
measure. To make this precise, let E n = U s es fat (n)* 2 £J / S . And set F = |J n> _ 7J - E n . This set 
has measure 

1^0 1 < E \ En \ - Yl 2£j ~ 2n - 227i - 
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We do not need to estimate Mj on this set. Using (4.5) we see that 

Bringing these estimates together, we see that for the collection S iat = \J n> _ tM j S iat (n), we 
have |{Mf fat > 1}| r< 2 27J ', as is required in (3.2). 

For n < —fJ>j, we need a more involved argument. We encode some of the necessary combi- 
natorics into this Lemma. 

4.6. Lemma. For n < —fj,j there is a set E n C K |i? n | z< 2™ so £na£ £ne collection 
Sfa.t(n) = {s G ^^(n) : I s <f_ E n } is a union of collections C/f at (n, fc), 1 < k < — 500n 7 which 
satisfies these properties. For each 1 < k < — 500n 7 

(i) U{gx(n,k) is uniquely decomposable into maximal disjoint trees T fat G Tf at (n, k). 

( n ) IIZ)r fct eT fct (n,fc) 1 ^ fat lloo di 2 10 ". 

fmj {2 _£n 7r fat x u;T fat : Tfat G Tf at (n, A;)} are pairwise disjoint rectangles, 

(iv) For all s G t/f at (n, fc) 

I s t |J : ^,J Tfat )<2 10 "|/ Tfat |}. 

7f at eT fat (n,fe) 

^ Either T fat = {J Tfat x ^T fat } /or a// T fat G T fat (n, k) or 7 Tfat x o; Tfat G" T fat /or a// 
T fa t G Tfat (n, A;). 

fwj 7/s,s' G C/ fat (ra, fc) U UUr fat * ^T fat : 7f at G T fat } and |/ s | < \I S ,\ then 2~ 200n \I s \ < \I S ,\. 

We do not estimate Mf on the set F\ = [L^_„,E n . As this set has measure IFJ ^ 2 _w ^ 
2~ 27j ', there is no harm in doing this. 

Off of this set, our lemma permits the following construction. For all n < —fij, < k < 
— 500n and Tf at G T fat (n, k) there is a functions iV Tfat for which 

(4.7) \N T ^ (x) - Mj fat (x) | r< 2 10n x/ Tfat (a;) 100 , iGK, 
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N T ^ 

Tf at 6T fat (n,fe) 



(4.8) The functions iV Tfat are disjointly supported in T fat G T fat (n, k). 

But then we can estimate by (4.4) 

""= E ii? 

Ti at eT (at (n,k) 
_<; jP2 n P+( 3 ~ 5 /p)Pi |/ Tf | 

7f at eT fat (n,fc) 

-<; j527j/4+n/4 

Thus certainly 

| j^t/f a tO,fc) > 2 n / 18 }| -<; 2 7 J/ 4 + n / 8 _ 
This is summable over n < —fij and < k < — 500n and so completes our proof of (3.2). 

[This interplay between L 2 and L p estimates is due to C. Thiele [11] and contrasts with the 
argument of Lacey and Thiele [5] . The latter paper uses two notions "energy" (the current 
"size") and "mass", which are in some sense dual to one another. The notion of "mass" 
seems to have little utility in this paper: "Mass" can be exploited through devices linked to 
the Hardy-Littlewood maximal function, but our kernels bear no close connection to that 
maximal function.! 



The construction relies on an argument from [6]. Fix n, k, set £/f at := Ufa t (n, k) and T fat := 
T fat (n, k). To each s G C/f at we construct a set G s as follows. Recall (v ) from the Lemma. If 
each Tf at G Tf at consists only of a top we set G s = 2~ £n I s where s is the top of the tree and 
jyT fat = l Gs X)aethin( s )(/' ( M < /V Then (4.8) follows from (Hi) and (4.7) follows from (4.5). 

We thus assume that no tree T fat G T fat contains its top. We then make the following 
definitions for s G T fat . 



'T fat 



- u ^ 



T'eT !at (s) 



T fat (s) := {Ti t G T fat - {T fat } : u al D uj tL , I tL C I T( J, 



crGthin(s) 



We verify (4.8). Since the support of <p a is in {x : N(x) G ou sl }, (4.8) is a consequence of 
the observation that if G s x u sl n Gv x o; s 'i 7^ then s and s' are in the same tree. Indeed 
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I T i X L0 T > 

± fat ± fat 



^Tfat >< W Tfat 



Figure 2: The top of the tree T fat does not intersect the set G s since the intervals I<r fat and i^y 
intersect. 

write s G T fat and s' G T/ at and assume say u s >i C cj s1 . If u s >i = u sl and the two trees 
are distinct then 7 Tfat and 7 Tf ' are disjoint by (i). Assume ou s >i uj s \ and G s fl G s i ^ 0. 
Then 00^,00^^ C c<j s1 . See figure 2. The tops ir fat and 7 Tf ' must intersect. Assuming 
Jjiat C ir f ' then s < x o; Tf / . But then condition (i) forces Tf at = T/ at . Thus we must 
have C ir fat , which by definition means that fl G s = 0, so that G s fl G s > = 0. This 
is a contradiction and so proves (4.8). 

We verify (4.7). In the case of x ^ It {sX this follows from (4.5) and conditions (iv) and (vi) 
of lemma 4.6. We do not comment further. For x G ir fat we in fact have N Tia - % (x) = Mj fat (x) 
unless x G ir/ at and T/ at G Tthin(s) f° r some s G Tf at . Indeed, with Tf at fixed we can assume 
that Try C ir fat for all T fat G T fat . Then we shall just reverse the order of summation below. 

\N T ^(x)-Mj^(x)\ < E K/,<M<^(z)|, 

T L eTfat -{T fat } <xGthin(T fat (T/ at )) 

where T fat (T/ at ) := {s G T fat : T/ at G T fat (s)} and T fat (s) was used to define G s . But again 
condition (iv) and (vi) imply that 

E l(/,i>^WI^»i T ,(x) 

aethin(T fat (T/ at )) 

and (4.7) follows from condition (ii). 
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Our proof of (3.2) is complete modulo the proofs of the lemmas, which are taken up in the 
next section. 



5 Proofs of the Lemmas 

Proof of lemma 4-1- The argument is a variant of one in [4] and has been used several times 
since. We give the details, although only small changes are needed to account for the disparity 
between fat and thin tiles. The most expedient treatment requires a new definition of a tree. 

Fix a choice of integer < k < 200 j. For a 1-tree T thin call a subset T/ hin C T thin a left-tree 
(right-tree) if there is a C,r thin £ w T th in with £r fat to the left (right) of every u s , s G T/ hin . In 
addition require that for all s G T/ hin U {^ h . x u t^. }> log 2 1 7 S | G k + 200jZ. Define "the left 
size of Sf at ," or £-size(Sf at ) as 

£-size(Sf at ) = sup{[|/ Tfat r 1 K/>^)| 2 ] 1/2 } 
where the supremum is over all left-trees T f ^ t with T fat C «Sf at . 

We prove this statement. For any finite Sf at C Pf at set e = £-size(S). Then S iat = S\ Q U S hi 
with £-size(5'i ) < e/A and Shi is a union of trees T fat C T fat with 

(5-1) E 1^1 =<e" 2 . 

Tf a t eT fat 

An inductive application of this statement proves lemma 4.1 with size(Sf at ) replaced by 
£-size(Sf at ). The factor j does not enter into this statement of the lemma. The same 
statement is true for right-size. Letting k vary from to 200j proves the Lemma as stated. 

The construction of S hi and T fat is inductive. The construction also associates to each 
Tfat G T fat a particular left-tree T t £ hin which are used to prove (5.1). Initially set S stock := S fat . 
Select a tree T fat C S stock so that 

(a) Tf at contains a left-tree T/ hin with 

E l(/,^>| 2 >(4 £ )- 2 |/T f J- 
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(b) 7 Tfat is maximal amoung trees satisfying condition (a) and T fat is the maximal tree in 
^stock with that top 

(c) £ T t is right-most amoung trees satisfying (a) and (b). 

Then add T fat to T fat , set S stock := S stock — T fat . Repeat this procedure until there is no tree 
satisfying (a). Then set S lo := S stock . By definition, £ - size(S lo ) < e/4. 

The left-trees we have constructed satisfy this disjointness property. For T fat ^ T/ at G T fat 
and a G T/ hin and </ G T/ Wn 

(5.2) if w CT uj a i then 7 Tfat n 7 CT / = 0. 

Indeed, w-r fat C uv so that < £ Tfat . Thus the tree Tf at was constructed before T fat . But 
if 7 Tfat H 7 S / 7^ we see that s' < 7r fat x o; Tfat where a G thin(s'). Hence s' G T fat which is a 
contradiction. See figure 3. 







s 




s' 




s" 




















7r fat x w Tfat 






fat fat 







Figure 3: By the manner in which the trees are constructed, the tree 7f at was constructed 
before T fat . Hence the tile s' must be in T fat . But the tile s" is a member T/ at . 

Let T red be those a G 7^ hin for which if |7 CT | < |7 T red| then dist(7 CT , 97 T red) > ^|7 Tre d|. ["red" 
is for "reduced." Note that the top is permitted to be in T red . And that if \I a \ < |7 TrC d| then 
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\I a \ is in fact much smaller than |/ T red l-] As £-size(S , fat ) < £, it follows that 

E i</><Mr>f^n 

trGT red 



Set S thin = U TfateTfat T red . And 

5:= I E (/^-)^ 

Observe that by Cauchy-Schwartz and ||/|| 2 = 1, 

^ E l^l< E I I" 
TfatGTf a t ^eSthin 

V, E (/»^>^ 

(5.3) < 5. 

To conclude (5.1) we show that 

(5-4) £ 2 ^ 2 E 1^1- 

7f at eTf at 



By expanding the L 2 norm B 2 < 2{B\ + B 2 ) where we define 

B )--= E </>^> E (<^<M(</v,/) 

S lhin( a ) '■= W e S'thin : = C<v}» &wJ<<>) '■= W € ^thin = U c UJ a >}. 

Note that if u a C we have 



(5.5) 



100 



To bound S x fix a dyadic interval c<j. This last estimate and Cauchy-Schwartz estimate 
shows that 

| E </>^> E (foMifaf) * E k/»^>i 2 - 
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Hence by (5.3) and summing over u, 

Bi^ E \(fM\ 2 u E i^j- 

ceSthin 7f a teTf at 

as £-size(S'thin) = £■ This is the first step in establishing (5.4). 

To control B\ we must use the disjointness property (5.2). Fix a tree T red and consider 
a G T red . Then the intervals {I a > : a' G Sthin(s)} are pairwise disjoint and contained in 
(ir fat ) c - To see this note that for all a', a" G Sthin(s) we have u a C c<v flov/. So (5.2) implies 
I a > fl Jo-// = 0. Then we can estimate 

E l</.^><^^X^,/>|de 2 E X/ CT (c(/ CT 0) 1O °l^l 

O"'eS t hin0) CT'65 th in(s) 




Here, we have in addition relied upon the estimate |(/, < e-^/l^l- Finally, the estimate 
below follows as J CT is both much smaller than I™ d and not close to the boundary of J^ ed . 
This completes the proof of (5.4). 

E / x/ CT (*) 90 ^|/ Tf j. 



□ 



Proof of lemma 4-3. We begin by verifying (4.5). For any a G P t hin and m > observe that 

(5.6) \<P*(x)\±\I*\- 1/2 XiAx) m , xeR. 

Indeed, after taking dilation and translation into account this estimate reduces to 

2^ 2 J + £ ) e (£ 2 + £y) <%\ ± 2^/ 2 (l + 2%|)""\ y G R. 

Here, tjj is a Schwarz function supported in | < |£| < 2 and 2- 7-1 < £ < 2 j+1 . But then 
at most ^ 1 oscillations of e(£ 2 ) are relevant to the integral, so the estimate follows by a 
repeated integration by parts. Then (5.6) plus a routine argument proves (4.5). 
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Turning to the estimate (4.4), note that any tree T t hi n C Pthm is a union of ^ j 1 and 
2 — trees. It suffices to prove (4.4) without the leading factor of j on the right for 1 and 
2 — trees. 

We consider first the case of a 2-tree T t hin- In this case, the sets lj s i for s G T thin are disjoint 
and for a ^ a' G T t hi n , we have either \I C \ ^ \Ia'\, m which case <p a and </v are disjointly 
supported, or l a fl l a > = 0, in which case we relie upon the decay (5.6). Thus, 

\Mj^(x)\±size(T thhi )xi Tth Jx) s . 

That is, (4.4) is trivially satisfied in this case. [This argument is the key motivation for the 
definitions of Pf at and P t hin-] 

We now turn to the case of a 1 — tree T t hm- A specific case unlocks the general case. Suppose 
that T thin is a tree with \I a \ = \I a i\ for all a, a' G T thin and = c(cu a; i). Then from (2.9) we 
have for all 2 < p < oo, 

||M T -i P < \\Dj( 

seT thin 

^ 2 .(i-iM|| j2 (f,<PM4 P 

o"eT thin 

(5.7) ^2^ 1 - 1 ^ +2 ^ 1 - 2 ^size(T thin )|/ rthin | 1 ^. 

The last line follows as there are 2 2j tiles a in any thin(s) for s G Pfat- Combine this with a 
trivial interpolation argument to conclude this case. 

More generally, for any tree T t h in , observe that there is a connection to the space of functions 
of bounded mean oscillation. The distinction between fat and thin tiles must enter into this 
relationship however. And in particular it is 

(5-8) || £ (/,0 CT )0 (T || BMO ^2^size(T thm ). 

This follows from the definition of size. 

Let ou TtMn = uj 2 ■ • • be the maximal sequence of dyadic intervals containing c<>r thin - 

Let T t ' hin = {s G T thin : uo s = uo 1 }. Then the functions M T ^ are disjointly supported in /. 
Hence, 

\\M T ^f p = ^||M T H|£ 
i 
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p 
p 

2~\ 1/2 



1 ^GT t ! Mn 



^ 2 M3-5/ P ) size(Tth . n) | /Tth; jl/ P> 

Here we relie on p > 2, (5.8) and(5.7). □ 

Proof of lemma 4-6. We shall show that there is a set E n C K so that |-E n | ^ 2 n and the 
collection Sf at (n) := {s G Sf at (n) : J s G: E n } is a union of collections U{ a t(n,k), < k < 
— 50n satisfying (i) — (in). 

The last three conditions of the Lemma are trivially satisfied by making further subdivisions 
of the subcollections £/f at (n, A;), and making a small further contribution to the exceptional 
set E n . Thus, the lemma will follow in complete generality. 

Fix n and set Sf at = Sf a t(n). Condition (ii) is also easy to satisfy. For the first contribution 
to our exceptional set, define 



E 1 : {.r : M*) > 2_1 

xes* 



J fat 



where S^ t consists of the maximal elements of Sf at . By (4.2) \E l \ -< 2 n . We can assume that 
for all s G Sf at , I s (£ E 1 . Then certainly (ii) is true. 

We now show that Sf at is decomposable into subcollections f/f at (/c), 1 < k < — lOn which are 
uniquely decomposable into maximal trees. This last condition is true iff to each s G U{ a t(k) 
there is a unique maximal s* G U^ t with s < s*. And this is so iff the collections U{ at (k) 
does not admit a vee in the partial order on tiles. A vee is three tiles s, s', s" with s < s', s" 
but s' and s" are not comparable with respect to the partial order on tiles. 

To acheive this, we employ a method of Fefferman [2]. Define a counting function 

C(s) := #K G SI, : s < s*}. 

Then C(s) < 2~ 10n for all s G S iat as (ii) is true. Take the sets U iat (k) to be {s : 2 fe_1 < 
C(s) < 2 k }. 
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That these sets do not contain vees follows immediately from the observation that C(s) is 
superadditive in this sense. If s,s',s" G Sf at is a vee, then C(s) > C(s') + C(s"). [Then if 
s',s" e U{ at (k) we see that C(s) > 2 fe_1 + 2 fc_1 = 2 fe , so it can not be in Uf at (k).] Indeed, 
there there can be no maximal tile s'" larger than both s' and s", for this would force s' and 
s" to be comparable in the partial order, as one checks immediately. Hence the maximal 
tiles greater than s' are disjoint from those greater than s", which proves the superadditivity 
property. 

The last condition to verify is (in), which requires another class of contributions to the 
exceptional set. Fix a choice of 1 < k < — lOn. Consider the maximal tiles U£ at (k). We 
want to separate these tiles after expanding the coordinates I s by a factor of 2 _en . This can 
be done, up to an exceptional set and a further division of Uf at (k), by applying Lemma 4.4 
to S = E/£ t , with A = T n . The details are omitted. 

□ 
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